Technical Report: GIT-CERCS-09-06 A Characterization and Analysis of GPGPU Kernels
نویسندگان
چکیده
General purpose application development for GPUs (GPGPU) has recently gained momentum as a cost-effective approach for accelerating dataand compute-intensive applications, pushed to the forefront by the introduction of Cbased programming environments such as NVIDIA’s CUDA, [1], OpenCL [2], and Intel’s Ct [3]. While significant effort has been focused on developing and evaluating applications and software tools, comparatively little has been devoted to the analysis and characterization of applications to assist future work in compiler optimizations, application re-structuring, and GPGPU micro-architecture design. This paper proposes a set of metrics for GPGPU workloads and uses these metrics to analyze the behavior of GPGPU programs. We report on an analysis of over 50 kernels and applications including the full NVIDIA CUDA SDK [4] covering control flow, data flow, parallelism and memory behavior. The analysis was performed using a full function emulator we developed that implements the NVIDIA virtual machine referred to as PTX (Parallel Thread eXecution architecture) a machine model and low level virtual ISA. The emulator can execute compiled kernels from the CUDA compiler, currently supports the full PTX 1.3 specification [5], and has been validated against the full CUDA SDK. The results quantify the importance of optimizations such as those for branch re-convergence, the prevalance of sharing between threads, and the opportunities for additional parallelism.
منابع مشابه
Technical Report GIT-CERCS-06-03 The Sleepy Keeper Approach: Methodology, Layout and Power Results for a 4-bit Adder
متن کامل
Abstracting Syntax
ing Syntax Brian Aydemir Stephanie Weirich Steve Zdancewic University of Pennsylvania {baydemir,sweirich,stevez}@cis.upenn.edu Technical Report MS-CIS-09-06
متن کاملAnalysis of convergence of solution of general fuzzy integral equation with nonlinear fuzzy kernels
Fuzzy integral equations have a major role in the mathematics and applications.In this paper, general fuzzy integral equations with nonlinear fuzzykernels are introduced. The existence and uniqueness of their solutions areapproved and an upper bound for them are determined. Finally an algorithmis drawn to show theorems better.
متن کاملبررسی تاثیر کرنل های مختلف بازسازی تصویر بر روی نویز و رزولوشن فضایی تصاویر سی تی اسکن: مطالعه بر روی فانتوم
Introduction: Noise and spatial resolution (SR) are the main characteristics of image quality in CT scanning affected by different radiation and technical parameters. One of the important parameters are reconstruction kernels. Different reconstruction kernels are used for noise reduction and/or edge-enhancement purposes. In this study, we investigate some reconstruction kern...
متن کاملTowards Multi-tenant GPGPU: Event-driven Programming Model for System-wide Scheduling on Shared GPUs
Graphics processing units (GPUs) are attractive to the generalpurpose computing (GPGPU) beyond the graphics purpose. Sharing GPUs among such GPGPU applications is a key requirement especially for cloud platforms whose resources are utilized by various cloud users. However, consolidating recent GPU applications, referred to as GPU eaters, on a GPU poses a new challenge. Such advanced application...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009